智能论文笔记

HOTCOLD Block: Fooling Thermal Infrared Detectors with a Novel Wearable Design

Hui Wei , Zhixiang Wang , Xuemei Jia , Yinqiang Zheng , Hao Tang , Shin'ichi Satoh , Zheng Wang

分类：计算机视觉

2022-12-12

Adversarial attacks on thermal infrared imaging expose the risk of related applications. Estimating the security of these systems is essential for safely deploying them in the real world. In many cases, realizing the attacks in the physical space requires elaborate special perturbations. These solutions are often \emph{impractical} and \emph{attention-grabbing}. To address the need for a physically practical and stealthy adversarial attack, we introduce \textsc{HotCold} Block, a novel physical attack for infrared detectors that hide persons utilizing the wearable Warming Paste and Cooling Paste. By attaching these readily available temperature-controlled materials to the body, \textsc{HotCold} Block evades human eyes efficiently. Moreover, unlike existing methods that build adversarial patches with complex texture and structure features, \textsc{HotCold} Block utilizes an SSP-oriented adversarial optimization algorithm that enables attacks with pure color blocks and explores the influence of size, shape, and position on attack performance. Extensive experimental results in both digital and physical environments demonstrate the performance of our proposed \textsc{HotCold} Block. \emph{Code is available: \textcolor{magenta}{https://github.com/weihui1308/HOTCOLDBlock}}.

translated by 谷歌翻译

RDA: An Accelerated Collision-free Motion Planner for Autonomous Navigation in Cluttered Environments

Ruihua Han , Shuai Wang , Shuaijun Wang , Zeqing Zhang , Qianru Zhang , Yonina C. Eldar , Qi Hao , Jia Pan

分类：机器人

2022-10-01

Motion planning is challenging for autonomous systems in multi-obstacle environments due to nonconvex collision avoidance constraints. Directly applying numerical solvers to these nonconvex formulations fails to exploit the constraint structures, resulting in excessive computation time. In this paper, we present an accelerated collision-free motion planner, namely regularized dual alternating direction method of multipliers (RDADMM or RDA for short), for the model predictive control (MPC) based motion planning problem. The proposed RDA addresses nonconvex motion planning via solving a smooth biconvex reformulation via duality and allows the collision avoidance constraints to be computed in parallel for each obstacle to reduce computation time significantly. We validate the performance of the RDA planner through path-tracking experiments with car-like robots in simulation and real world setting. Experimental results show that the proposed methods can generate smooth collision-free trajectories with less computation time compared with other benchmarks and perform robustly in cluttered environments.

translated by 谷歌翻译

Physical Adversarial Attack meets Computer Vision: A Decade Survey

Hui Wei , Hao Tang , Xuemei Jia , Hanxun Yu , Zhubo Li , Zhixiang Wang , Shin'ichi Satoh , Zheng Wang

分类：计算机视觉

2022-09-30

Although Deep Neural Networks (DNNs) have achieved impressive results in computer vision, their exposed vulnerability to adversarial attacks remains a serious concern. A series of works has shown that by adding elaborate perturbations to images, DNNs could have catastrophic degradation in performance metrics. And this phenomenon does not only exist in the digital space but also in the physical space. Therefore, estimating the security of these DNNs-based systems is critical for safely deploying them in the real world, especially for security-critical applications, e.g., autonomous cars, video surveillance, and medical diagnosis. In this paper, we focus on physical adversarial attacks and provide a comprehensive survey of over 150 existing papers. We first clarify the concept of the physical adversarial attack and analyze its characteristics. Then, we define the adversarial medium, essential to perform attacks in the physical world. Next, we present the physical adversarial attack methods in task order: classification, detection, and re-identification, and introduce their performance in solving the trilemma: effectiveness, stealthiness, and robustness. In the end, we discuss the current challenges and potential future directions.

translated by 谷歌翻译

Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition from Egocentric RGB Videos

Yilin Wen , Hao Pan , Lei Yang , Jia Pan , Taku Komura , Wenping Wang

分类：计算机视觉 | 机器人

2022-09-20

由于自我批判性和歧义，了解动态的手动运动和动态动作是一项基本而又具有挑战性的任务。为了解决遮挡和歧义，我们开发了一个基于变压器的框架来利用时间信息以进行稳健的估计。注意到手部姿势估计和动作识别之间的不同时间粒度和语义相关性，我们建立了一个网络层次结构，其中有两个级联变压器编码器，其中第一个利用了短期的时间cue进行手姿势估算，而后者则每次聚集物，后者每次聚集体 - 帧姿势和对象信息在更长的时间范围内识别动作。我们的方法在两个第一人称手动作基准（即FPHA和H2O）上取得了竞争成果。广泛的消融研究验证了我们的设计选择。我们将开放源代码和数据以促进未来的研究。

translated by 谷歌翻译

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

Hongyang Li , Chonghao Sima , Jifeng Dai , Wenhai Wang , Lewei Lu , Huijie Wang , Enze Xie , Zhiqi Li , Hanming Deng , Hao Tian

分类：计算机视觉 | 机器学习 | 机器人

2022-09-12

在鸟眼中学习强大的表现（BEV），以进行感知任务，这是趋势和吸引行业和学术界的广泛关注。大多数自动驾驶算法的常规方法在正面或透视视图中执行检测，细分，跟踪等。随着传感器配置变得越来越复杂，从不同的传感器中集成了多源信息，并在统一视图中代表功能至关重要。 BEV感知继承了几个优势，因为代表BEV中的周围场景是直观和融合友好的。对于BEV中的代表对象，对于随后的模块，如计划和/或控制是最可取的。 BEV感知的核心问题在于（a）如何通过从透视视图到BEV来通过视图转换来重建丢失的3D信息；（b）如何在BEV网格中获取地面真理注释；（c）如何制定管道以合并来自不同来源和视图的特征；（d）如何适应和概括算法作为传感器配置在不同情况下各不相同。在这项调查中，我们回顾了有关BEV感知的最新工作，并对不同解决方案进行了深入的分析。此外，还描述了该行业的BEV方法的几种系统设计。此外，我们推出了一套完整的实用指南，以提高BEV感知任务的性能，包括相机，激光雷达和融合输入。最后，我们指出了该领域的未来研究指示。我们希望该报告能阐明社区，并鼓励对BEV感知的更多研究。我们保留一个活跃的存储库来收集最新的工作，并在https://github.com/openperceptionx/bevperception-survey-recipe上提供一包技巧的工具箱。

translated by 谷歌翻译

Modelling Power Consumptions for Multi-rotor UAVs

Hao Gong , Baoqi Huang , Bing Jia , Hansu Dai

分类：机器人

2022-09-09

无人驾驶汽车（UAV）具有各种优势，但是它们的实际应用受其能源有限的影响。因此，管理其功耗很重要，并且建立相应的功耗模型也很重要。但是，大多数现有作品要么为固定翼无人机和单权无人机建立理论功耗模型，要么为无需严格的数学推导而为多旋转无人机提供启发式功耗模型。本文旨在为多旋转无人机建立理论功耗模型。具体而言，通过利用单旋风无人机与多机无人机之间的关系，得出了三个飞行状态的多旋转无人机的封闭形式消耗模型，即远南飞行，垂直上升和垂直下降。功耗条款。在此基础上，在三维（3-D）方案中，无人机的通用飞行功耗模型。通过使用DJI M210和DJI移动SDK在实际场景中制作的移动应用程序进行广泛的实验，并确认这些模型的正确性和有效性；此外，进行模拟以进一步研究转子数量对无人机的功耗的影响。拟议的功耗模型不仅揭示了多旋转无人机的功耗如何受到各种因素的影响，而且还为引入其他新型应用程序铺平了道路。

translated by 谷歌翻译

Cross-Network Social User Embedding with Hybrid Differential Privacy Guarantees

Jiaqian Ren , Lei Jiang , Hao Peng , Lingjuan Lyu , Zhiwei Liu , Chaochao Chen , Jia Wu , Xu Bai , Philip S. Yu

分类：人工智能

2022-09-04

整合多个在线社交网络（OSN）对许多下游社交挖掘任务（例如用户偏好建模，建议和链接预测）具有重要意义。但是，不幸的是，伴随着越来越多的隐私问题，泄漏敏感用户信息。如何完全利用来自不同在线社交网络的数据，同时保存用户隐私仍然无法解决。为此，我们提出了一个跨网络的社交用户嵌入框架，即DP-Crosue，以一种隐私性的方式学习用户的全面表示。我们共同考虑具有不同隐私保证的部分调整社交网络的信息。特别是，对于每个异质社交网络，我们首先引入一个混合差异隐私概念，以捕获异构数据类型的隐私期望的变化。接下来，为了找到跨社交网络的用户链接，我们进行了无监督的基于用户嵌入的对齐方式，其中通过异质网络嵌入技术实现了用户嵌入。为了进一步增强用户嵌入，一种新颖的跨网络GCN嵌入模型旨在通过那些对齐用户跨网络传输知识。在三个现实世界数据集上进行的广泛实验表明，我们的方法对用户兴趣预测任务以及捍卫用户属性推理攻击的嵌入进行了重大改进。

translated by 谷歌翻译

Automating DBSCAN via Deep Reinforcement Learning

Ruitong Zhang , Hao Peng , Yingtong Dou , Jia Wu , Qingyun Sun , Jingyi Zhang , Philip S. Yu

分类：机器学习

2022-08-09

DBSCAN由于其简单性和实用性而被广泛用于许多科学和工程领域。但是，由于其高灵敏度参数，聚类结果的准确性在很大程度上取决于实践经验。在本文中，我们首先提出了一种新颖的深钢筋学习指导自动DBSCAN参数搜索框架，即DRL-DBSCAN。该框架通过将聚类环境视为马尔可夫决策过程来模拟调整参数搜索方向的过程，该过程旨在在没有手动帮助的情况下找到最佳的聚类参数。 DRL-DBSCAN使用弱监督的奖励培训策略网络，通过与群集进行交互来了解不同特征分布的最佳聚类参数搜索策略。此外，我们还提出了一个由数据规模驱动的递归搜索机制，以有效且可控制地处理大参数空间。基于拟议的四种工作模式，在五个人工和现实世界数据集上进行了广泛的实验。离线和在线任务的结果表明，DRL-DBSCCUN不仅始终如一地提高DBSCAN聚类精度高达26％和25％，而且可以稳定地找到具有较高计算效率的主要参数。该代码可在https://github.com/ringbdstack/drl-dbscan上找到。

translated by 谷歌翻译

Improving Policy Optimization with Generalist-Specialist Learning

Zhiwei Jia , Xuanlin Li , Zhan Ling , Shuang Liu , Yiran Wu , Hao Su

分类：机器学习

2022-06-26

对看不见的环境变化的深入强化学习的概括通常需要对大量各种培训变化进行政策学习。我们从经验上观察到，接受过许多变化的代理商（通才）倾向于在一开始就更快地学习，但是长期以来其最佳水平的性能高原。相比之下，只接受一些变体培训的代理商（专家）通常可以在有限的计算预算下获得高回报。为了两全其美，我们提出了一个新颖的通才特权训练框架。具体来说，我们首先培训一名通才的所有环境变化。当它无法改善时，我们会推出大量的专家，并从通才克隆过重量，每个人都接受了训练，以掌握选定的一小部分变化子集。我们终于通过所有专家的示范引起的辅助奖励恢复了通才的培训。特别是，我们调查了开始专业培训的时机，并在专家的帮助下比较策略以学习通才。我们表明，该框架将政策学习的信封推向了包括Procgen，Meta-World和Maniskill在内的几个具有挑战性和流行的基准。

translated by 谷歌翻译

Benchmarking Node Outlier Detection on Graphs

Kay Liu , Yingtong Dou , Yue Zhao , Xueying Ding , Xiyang Hu , Ruitong Zhang , Kaize Ding , Canyu Chen , Hao Peng , Kai Shu

分类：机器学习

2022-06-21

图形离群值检测是一项具有许多应用程序的新兴但至关重要的机器学习任务。尽管近年来算法扩散，但缺乏标准和统一的绩效评估设置限制了它们在现实世界应用中的进步和使用。为了利用差距，我们（据我们所知）（据我们所知）第一个全面的无监督节点离群值检测基准为unod，并带有以下亮点：（1）评估骨架从经典矩阵分解到最新图形神经的骨架的14个方法网络；（2）在现实世界数据集上使用不同类型的注射异常值和自然异常值对方法性能进行基准测试；（3）通过在不同尺度的合成图上使用运行时和GPU存储器使用算法的效率和可扩展性。基于广泛的实验结果的分析，我们讨论了当前渠道方法的利弊，并指出了多个关键和有希望的未来研究方向。

translated by 谷歌翻译